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ABSTRACT 

The  problem  of  attitude  control  of  a  body  described  by  die  equation 
I0(t)  3  T(t)  is  examined,  where  I  is  the  moment  of  inertia,  0(t)  is  the 
angular  displacement,  and  r(t)  is  the  torque  exerted  by  the  reaction  jets. 
There  exists  a  magnitude  restriction  on  the  torque,  i.  e. ,  |  r(t)|  =»  M  and, 
in  addition,  0(t)  lies  in  the  interval  -  ir  s  0(t)  s  ir.  Under  these  restrictions 
the  time -optimal  and  the  fuel -optimal  control  laws  are  derived  and  the 
design  of  the  corresponding  optimal  controllers  is  presented. 
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1.  INTRODUCTION 

Second  order  systems  described  by  a  differential  equation  of  the  form 
y(t)  =  f(t),  |f(t)|  s  1,  have  received  considerable  attention  by  various  investigators. 

The  time-optimal  control  of  such  systems  is  treated  in  Refs.  1,2,3,  and  4.  Recently, 
the  fuel -optimal  control  of  such  systems  has  been  considered.  Reference  5  contains 
the  elements  of  fuel -optimal  control  with  unrestricted  response  time.  References  6 
and  7  deal  with  the  fuel -optimal  control  where  the  response  time  is  fixed  a  priori. 
Reference  8  contains  the  techniques  of  fuel -optimal  control  with  an  upper  bound  on  the 
response  time. 

A  common  characteristic  of  the  results  available  to  date  is  the  assumption  that 
the  (output)  variable  y(t)  can  be  measured  and  that  its  values  lie  in  the  interval 
-*  <  y(t)  <  +«.  If  such  Is  the  case,  then  the  problem  of  forcing  y(t)  and  y(t)  to  zer. 
in  minimum  time  is  solved  and  the  solution  is  well  known:  one  divides  the  phase  plane 
into  two  regions  by  means  of  the  "switch"  curve  such  that  in  one  side  f(t)  *  +1  and  on 
the  other  f(t)  *  -1.  Similarly,  for  the  fuel -optimal  problem  the  phase  plane  is  divided 
into  three  regions  such  that  in  the  first  region  f(t)  =  +1,  in  the  second  f(t)  =  0,  and  in 
the  third  f(t)  =  -1.  Moreover,  one  can  design  simple  nonlinear  feedback  controllers 
for  minimum -time  and  minimum -fuel  control.  It  must  be  emphasized  that  these  results 
are  valid  if  and  only  if  -  •  <  y(t)  <  «. 

In  this  paper  the  attitude  control  problem  of  a  system  described  by  the 
differential  equation  I  0  (t)  *  r(t),  |r(t)|  ^  M,  -its  0(t)  s  ir  is  considered.  Note  that 
an  additional  restriction  on  the  values  of  the  angular  displacement  variable  0(t)  is 
imposed.  It  will  be  shown  that  this  restriction  on  0(t),  which  is  the  result  of  the 
inability  of  the  measuring  instruments  to  distinguish  between  an  angle  0(t)  and  an  angle 
0(t)  +  2nir ,  changes  significantly  both  the  time-optimal  and  the  fuel -optimal  control 
laws.  Engineering  realizations  of  the  optimal  control  laws  will  also  be  given  and  their 
operation  will  be  explained. 


1 


2.  THE  PHYSICA  L  SYSTEM 


Consider  a  body  rotating  in  a  plane  as  illustrated  in  Fig.  1.  The  rotation  of  the 
body  can  be  controlled  by  the  reaction  jets  which  provide  a  thrust  f(t).  Let  I  denote 
the  moment  of  inertia  of  the  body  about  an  axis  perpendicular  to  the  plane  of  rotation 
and  passing  through  the  center  of  mass.  Let  r(t)  be  the  torque  produced  by  the 
reaction  jets,  i.  e. 

r(t)  =  c  f(t)  .  (1) 


In  the  absence  of  friction,  the  differential  equation  of  the  rotating  body  is 

Ifl(t)  =  T(t)  .  (2) 

Since  the  measuring  instruments  cannot  distinguish  between  an  angle  0(t)  and  an  angle 
0(t)  +  2nir ,  n  *  1, 2, ... ,  the  angular  position  0(t)  is  restricted  by 

-  ir  a  0(t)  a  »  (3) 

The  reaction  jets  deliver  a  thrust  f(t)  which  is  bounded,  i.  e. 

|f(t)|  s  F  (4) 


Define  u(t)  by 

*0-^-  .  (5) 

Clearly,  |u(t)j  a  M,  where  M  =  c  F/I.  For  the  sake  of  simplicity  let  M  =  1  and  so 
the  magnitude  restrictions  on  the  control  variable  u(t)  are 

|u(t)|  a  l  (6) 


for  all  t.  Define  the  state  variables  x^(t)  and  x2<t)  by 

x^t)  =  0(t)  ;  x2(t)  =  0'(t) 

Then  the  x^(t)  and  x^t)  state  variables  satisfy  the  differential  equations 

Xj(t)  =  x2(t) 
x2(t)  =  u(t) 


(7) 


(8) 


2 


t 

! 

i 


where  Xj(t)  is  restricted  by 


for  all  t. 


-V  *  Xj(t)  ^  +7T 


3.  THE  TIME -OPTIMAL  PROBLEM 


The  minimum -time  control  problem  for  the  rotating  body  is: 

Given  the  system  described  by  Eq.  (8)  with  the  control  u(t)  restricted 
by  Eq.  (6)  and  with  the  x^t)  variable  restricted  by  Eq.  (9),  determine 
the  control  which  will  force  any  state  to  the  zero  state  in  minimum  time. 

It  is  well  known  that  for  a  second  order  system,  such  as  the  rotating  body 
considered  here,  the  time-optimal  control  u(t)  is  either  +1  or  -1  and  it  may  switch 
from  +1  to  -1  or  from  -1  to  +1  at  most  once  (see  for  example  Ref.  4).  It  is  desired 
to  find  the  value  of  the  control  for  every  possible  state  (Xj,  x^)  where  -tr  s  +7r. 

The  method  of  attack  is  as  follows:  A  family  of  curves  will  be  defined  which 
will  divide  the  state  plane  (l.e.  the  strip  of  the  x^  -  x  plane  defined  by 
-ir  s  Xj  s  +ir,  -•  <  x2  <  *° )  into  several  regions.  For  each  point  in  the  state  plane  the 
value  of  the  control  which  will  provide  the  solution  to  the  time -optimal  problem  will  be 
stated.  The  elements  of  the  proof  of  the  time -optimality  will  follow. 

Definition  1.  Consider  the  family  of  curves  in  the  state  plane  described  by  the 
equation 

*i  =  nit  -|x2lx2!  (10) 

where 

n  =  0,  +1,  +2,...  (11) 

and 

-ff  s  £  +r  .  (12) 

A  state  (Xj,x2)  which,  for  a  particular  value  of  n,  satisfies  Eq.  (10)  is  said  to  belong 

to  the  y  curve, 
n 

Several  of  the  curves  are  shown  in  Fig.  2.  Some  properties  of  the  yn  curves 
will  now  be  developed. 

Let  u(t)  =  A  *  +  1  in  Eq.  (8)  and  solve  for  x^(t)  and  x^(t)  using  the  initial 
conditions 

XjW-tj  ;  x2(0)M2  (13) 
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The  state  variables  as  functions  of  time  are  found  to  be 


xi<t) » +  42t  +  j^t2 
x2<t)  ■  4  2  +  At 


(14) 


Eliminating  the  time  t  in  the  above  equations  one  finds  the  equation  of  a  trajectory  in 
the  state  plane 

:  (15) 

which  joins  the  state  (4  ^  4  2)  with  the  state  (x^,  x2)  when  the  control  is  u(t)  =  A  =  +  1. 

Consider  the  portion  of  the  y curve  above  die  x^  axis,  which  is  labeled  yq_ 
in  Fig.  2.  The  equation  of  the  yQ_  curve  is  obtained  from  Eq.  (9)  using  n  =  0  and 
x2  >  0;  therefore 

xi  =  -jx2  .  (16) 


Now  in  Eq.  (15)  let  A  =  -1  and  Xj  =  x2  =  0  to  find 

4  =  -  -42 

S1  2  s2 


(17) 


Equations  (16)  and  (17)  are  identical  and  one  concludes  that  every  point  on  the  y^_  curve 
can  be  forced  to  (0,0)  using  u(t)  =  -1.  Similarly,  one  finds  that  every  point  on  the 
curve  (which  is  the  portion  of  the  Yq  curve  below  the  Xj  axis)  can  be  forced  to  (0, 0) 
using  u(t)  =  +1,  As  a  matter  of  fact  the  yQ  curve  is  the  portion  of  the  switch  curve  (see 
Refs.  1,2,3, 4)  in  die  strip  defined  by  -ir  £  Xj  s  +tt . 

Now  consider  the  y2  curve  whose  equation  is 

xl32ff-£x2  (18) 

obtained  from  Eq.  (9)  using  n  =  2  and  n2  >  0.  The  y^  curve  hits  the  line  x^  =  ir  at  the 
point  (ir,*/2 ff).  Now  consider  the  equation  of  the  trajectory  obtained  from  Eq.  (15)  with 
A  =  -1,  Xj  =  ir  and  x2  =  *12 V .  This  trajectory  has  the  equation 


(19) 


Equations  (18)  and  (19)  are  identical  and  one  concludes  that  every  point  on  the  y^  curve 
can  be  forced  to  the  point  (ir,N/2ir" )  using  u(t)  ±  -1.  However,  die  points  (-jt,*/2F ) 
and  (jt.n/IF )  are  equivalent  and  one  may  conclude  that  every  point  on  the  y^  curve 
can  be  forced  to  (0,0)  using  u(t)  =  -1  and  that  the  path  is  the  y^  curve  until  the  point 
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(n,  \I2tT )  is  reached  and  then  the  Yq_  curve.  Using  similar  reasoning  one  establishes 
the  following  prepositions. 

Preposition  1.  Any  point  on  the  yo_,  y 2>  y 6 .  curves  can  be  forced  to 

(0, 0)  using  u(t)  =  -1  and  the  path  followed  consists  of  the  y  ,  y^,  y^,  y6, ....  curves. 

Preposition  2.  Any  point  on  the  yof,y_2.y.4,y_6 .  curves  can  be  forced 

to  (0, 0)  using  u(t)  =  +1  and  the  path  followed  consists  of  the  y  ,  y_2,  y_4>  y  ^ . 

curves. 

These  two  prepositions  establish  properties  of  the  yQ  curves  for  even  values  of 


n. 

Definition  2.  The  statement  "apply  first  the  control  u1  and  then  the  control 
u  , . . . "  will  be  abbreviated  by  "apply  the  control  sequence  {u  ,  u 

The  properties  of  the  yQ  curves,  for  n  odd,  will  now  be  derived.  Consider  a 
state  X  =«  (x. ,  x„)  in  the  region  between  the  y  curve  and  the  y.  curve  as  Illustrated  in 
Fig.  3.  There  are  two  ways  of  reaching  the  origin  (0, 0)  using  a  control  which  switches 
only  once.  One  way  is  to  apply  the  control  sequence  {-l,+l},  i.e.  to  apply  u(t)  =  -1 
until  the  trajectory  reaches  the  Y  curve  at  die  point  Y  and  then  apply  u(t)  =  +1  to 

Chr 

reach  the  origin.  Let  the  time  required  by  the  control  sequence  {-l,+l}  be  Tj.  The 
trajectory  resulting  from  the  sequence  {-l,+l}  is  XYO  in  Fig.  3.  The  other  way  is 
to  apply  the  control  sequence  {+1, -l},  i.e.  apply  u(t)  =  +1  until  the  trajectory  hits 
the  y2  curve  at  the  point  Z  and  then  apply  u(t)  =  -1  to  reach  (0,0)  along  the  y^  and 
Yq_  curves  (cf  Preposition  1).  The  trajectory  resulting  from  die  control  sequence 
{+l,-l}  is  XZAA'O  in  Fig.  3  and  denote  the  time  required  by  T2* 

It  is  easy  to  evaluate  die  times  Tj  and  T2  required  by  the  two  methods  of 
control,  using  Eq.  (14).  One  finds 

Tj-x2-2y2  (20) 

and 


But  from  Eqs.  (16)  and  (IS) 


(21) 


(22) 
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and  from  Eqs.  (18)  and  (15) 


z2  *  n/-Xj  +  ix2  +  2*  .  (23) 

Substituting  Eq.  (22)  Into  Eq.  (20)  one  finds 

Tx  =  x2  *  2  Vxj  +  i  x2  (24) 

and  from  Eqs.  (23)  and  (21) 

T2  *  2  V-Xj  +  i  x2  +  2*  -x2  .  (25) 

Now  consider  the  set  of  states  (x^  x2)  with  die  property  that 

T1  =  T2  .  (26) 

Such  states  must  satisfy  the  equation 

x2  -  2  Jx1  +  |x2  =  2  'J‘Xl  +  i  x2  +  2ir  -  x2  (27) 

which  simplifies  to,  after  a  series  of  algebraic  manipulations, 

X1  =  *  *  Jx2  (28) 

From  Eq.  (10),  setting  n  =  1  and  x2  >  0,  the  equation  of  the  y^  curve  is 

xi  “  *  ■  i*2  *  (29* 

Equations  (28)  and  (29)  are  identical  and,  so,  one  immediately  concludes  that  the  y^ 
curve  is  a  locus  of  points  in  the  state  plane  which  require  die  same  time  to  reach  the 
origin  using  either  the  control  sequence  {+1,  -l},  or  die  control  sequence  {-l,+l}. 

Using  identical  reasoning  one  can  compute  die  locus  of  all  points  in  the  state 
plane  which  require  die  same  time  to  reach  die  origin  using  either  the  control  sequence 
{+1,  -l}  or  {-l,+l}.  For  example,  if  one  chooses  an  initial  state  in  the  region  between 
the  ?2  and  y^  curves,  one  will  find  that  the  y^  curve  is  die  locus  of  points  which 
require  the  same  time  to  reach  die  origin  using  either  the  control  sequence  {+1,  -l}  or 
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{-l,+l}.  Thus,  one  can  establish  the  following  preposition. 

Preposition  3.  Any  point  on  the  curves  for  odd  values  of  n  can  be  forced  to 

(0,0)  using  either  the  control  sequence  {+1,  -l}  or  the  control  sequence  {-l,+l}  and 

requiring  the  same  response  time  in  either  case. 

Definition  3.  Let  R  , ,  denote  the  region  which  lies  between  the  y  and 
-  m,  m+i  m 

y  curve.  More  precisely,  the  state  (x,,x.)  belong  to  the  region  (or  set)  R  , . 
m+l  m,  m+i 


and  if 


-ir  £  Xj  ^  +ir  I 

2(mir  -  xp  <  x2|x2|  <  2((m+l)ir  -  Xj) 


(30) 


for  m  =  0,  +1,  +  2, . . . , .  For  example,  the  region  R  „  is  the  region  between  the 
-  - 

y  2  and  the  y  ^  curve. 

The  regions  R  ,  are  illustrated  in  Fig.  2. 
m,  m+l 

Theorem  1.  Given  a  state  (Xj.x^  in  the  state  plane.  The  time -optimal  control, 
that  is  the  control  which  will  force  any  state  (Xj.x^  to  (0, 0)  in  the  shortest  possible 
time  is 

a  (x1,x2),Rm  m+r  then  u  -  (-l)m+1. 

If  (x,,x.)ey  and  if  m  =  2,4,6, .. .,  thenu=-l. 
i  i  m 

If  (x.,x_)ey  and  if  m  = -2, -4, -6, . . . ,  thenu  =  +l. 
i  i  m 

If  (x1>x2)f  yQ_,  then  u  =  -1. 

If  (x1»x2)eY0+>  *en  u  =  +l. 

If  (x1,x2)€ym  for  m  odd,  then  u  =  -1  or  u  *  +1  (the  time  response  is  the 

same). 

The  above  theorem  provides  the  value  of  the  time  -optimal  control  for  each  state 
in  the  state  plane.  The  proof  of  Theorem  1  is  given  in  Appendix  1.  The  system  shown  in 
block  diagram  form  in  Fig.  4  represents  an  engineering  realization  of  the  control  law 
given  by  the  theorem  and  it  will  result  in  the  minimum  response  time  control  of  the 
rotating  body. 

The  design  and  operation  of  die  system  of  Fig.  4  is  based  on  die  following 
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reasoning.  Define  a  function  hR  by 

hn  =  Xl  ‘n7r+2X2'X2!  *  (31) 

Note  that  if 

h  =  0  (32) 

n  ' 

then  the  state  (Xj ,  x^)  belongs  to  the  Yn  curve  (see  Definition  1).  If 

h  >  0  (33) 

n 

then  the  state  (x^,  x2)  is  above  the  y^  curve  and,  finally,  if 

h  <  0  (34) 

n 

then  the  state  (Xj,x„)  is  below  the  yR  curve.  As  indicated  in  Fig.  4,  die  signals  hn 

are  generated  from  the  x^  and  x2  state  variables  and  a  constant  source  providing  the 

signal  ir.  Hie  only  nonlinearity  required  is  of  die  square  law  type,  indicated  by  the 

block  N  in  Fig.  4.  The  nonlinearity  N  is  such  that  its  output  signal  is  ix2|x2|  when 

the  input  signal  is  x2<  From  the  signal  hQ  the  system  generates  all  other  signals  hn 

by  repeated  addition  or  subtraction  of  it . 

Suppose  that  the  measured  state  (x,,x„)  is  in  the  region  R  , ,,  that  is  the 

l  i  m,  m+i 

state  lies  between  the  y  and  y  , ,  curves.  Then  the  signals  h  will  be: 

m  m+i  n 

> 

h  >  0  for  all  |im 

(35) 

hi  <  0  for  all  i  s  m+1 

<< 

For  clarity,  suppose  that  the  signals  h„,  h„ ,  h .  are  negative  and  that  die  signals 

hj, h^  h  j, h_2 .  are  positive,  as  illustrated  in  Fig.  4.  Then  die  measured  state 

point  (Xj,x2)  is  between  the  y^  and  y2  curve,  i.  e.  it  belongs  to  the  Region  R^  ^ 

As  shown  in  Fig.  4  each  h.  signal  activates  a  polarized  relay  with  outputs  +1  and  -1. 

Let  q^  be  the  output  of  the  itfl  relay,  i.  e.  q^,  =  sgn  {h^  =  +  1.  For  example,  suppose 

that  in  Fig.  4  q2  -  q^  =  q4  =  .. .  =  -1  and  q1  =  qg  =  q^  =  q_2  =  . . .  =  +1.  Connect  the 

coils  of  normally  open  relays  r  between  adjacent  signals  a  and  q  .,  where  the 

1  i+1  ^ 

contact  of  the  r^  relay  is  connected  to  a  voltage  (-1)  and  the  outputs  of  all  die  relays 
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are  connected  to  a  common  terminal.  Only  one  of  the  relays  r^  will  be  activated.  For 
example,  in  the  specific  situation  discussed  above  only  the  relay  r^  will  be  closed  and 
all  the  other  relays  will  remain  open,  because  there  exists  a  voltage  difference  only 
across  the  coil  of  the  relay  r^  and  there  is  no  voltage  drop  across  the  coils  of  all  the 
other  relays.  The  signal  at  the  common  terminal  will  have  the  correct  polarity  to 
activate  the  relay -like  mechanism  of  the  gas  jets  which  will  now  produce  the  correct 
thrust  to  drive  9  and  9  to  zero  in  minimum  time. 

In  any  physical  situation  one  can  expect  a  bound  on  the  magnitude  of  the  angular 
velocity  x^l  therefore  the  nonlinear  computer  of  Fig.  4  would  be  terminated  at  the 
required  number  of  stages. 


4.  THE  FUEL- OPTIMAL  PROBLEM 


The  minimum -fuel  attitude  control  problem  for  the  rotating  body  is  to  do.ive 
fl(t)  and  0(t)  to  zero  using  as  little  gas  (or  fuel)  as  possible.  More  precisely:  '  “ 
Given  the  system  described  by  Eq.  (8)  with  the  control  u(t)  restricted 
in  magnitude  by  Eq.  (6)  and  with  the  x^(t)  state  variable  restricted  by 
Eq.  (9),  determine  the  control  which  will  force  any  state  to  the  zero 
state  with  minimum  consumed  fuel  F  measured  by 


where  T  is  die  response  time  (unspecified)  and  k  is  a  positive  constant 
of  proportionality.  Furthermore,  if  more  than  one  control  requires  the 
same  minimum  fuel,  then  find  the  fuel -optimal  control  which  requires 
the  least  response  time. 

It  is  well  known  (see  Refs.  5, 6,7,8)  that  for  a  second  order  system,  such  as  the 
rotating  body  considered  in  this  paper,  the  fuel -optimal  control  u(t)  is  either  +1,  0,  or 
-1  and  that  the  only  control  sequences  (see  Definition  2)  that  may  be  used  are  {o},  (fl}, 
{-l}.  frl.O},  {-1,0},  {0,+l},  {0, -l},  {+1,0,  -1}  and  {-1,0, +l}. 

Consider  the  family  of  the  curves  (see  Definition  1)  described  by 

x1  =  nff  -  JX2U2|  (37) 

where 

n  =  0,  +  2,  +  4,  +6,  ...  ,  (38) 

and 

-  7T  S  Xj  s  +  7T  .  (39) 

This  family  of  Yn  curves  (n  even)  is  shown  in  Fig.  5. 

The  fuel  -optimal  control  law  is  provided  by  the  theorem  below. 

Theorem  2.  Given  a  state  (Xj^)  in  the  state  plane. 

If  (x.,x0)  belongs  to  the  y  curve  for  n  =  2,4,6 . then  u  =  -1. 

i  t  n 
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If  (x.,x  )  belongs  to  the  y  curve  for  n  =  -2,  -4,  -6 .  then  u  =  +l. 

i  z  n 

If  (x.,xj  belongs  to  the  y  curve,  then  u  =  +1. 

If  (x1(x2)  belongs  to  the  yQ_  curve,  then  u  =  -1. 

For  all  other  points  in  the  state  plane  u  =  0. 

The  proof  of  this  theorem  is  very  similar  to  the  proof  presented  in  Ref.  5  and 
is,  therefore,  omitted  for  the  sake  of  brevity.  Figure  6  illustrates  an  engineering 
realization  of  the  control  law  provided  by  Theorem  2.  The  signals  hn,  for  n  =  0,  +2, 
+  4, . . . ,  are  generated  as  explained  before.  If  the  state  point  belongs  to  the  y  curve, 
then  the  h.  signal  is  zero.  In  Fig.  6  the  signals  hQ  and  are  used  to  generate  the 
two  signals  h  and  h  if  h  =0,  then  (x.,x0)  belongs  to  the  y  curve  and  if 

h  =0,  then  (x.,x„)  belongs  to  the  y  curve.  Each  signal  h.,  i  =  +  2,  +4 . and 

hQ  and  h^  activate  normally  closed  relays  so  that  a  voltage  of  -2  volts  appears  on 

the  common  terminal  if  the  state  point  is  on  the  yQ_,  y  ,  y 4 . curves  and  a  voltage 

of  +2  volts  appears  only  if  the  state  belongs  to  the  y^_,  y  ^,  y  ^ .  curves.  For 

any  other  state  the  relays  are  open  and  no  voltage  appears  at  the  terminal.  Thus,  the 
signals  at  the  terminal  can  be  used  to  open  or  close  the  reaction  jets  on  the  body  and  to 
deliver  the  correct  thrust  for  minimum  fuel  operation. 

If  the  bound  on  the  magnitude  of  the  angular  velocity  x2  is  known,  the  computer 
of  Fig.  6  would  again  have  a  finite  number  of  stages. 


5.  DISCUSSION  OF  THE  RESULTS 


In  the  two  previous  sections  the  time  -optimal  and  the  fuel  -optimal  control  laws 
were  derived  and  the  corresponding  controllers  were  designed.  In  this  section,  equations 
for  the  response  time  and  for  the  consumed  fuel  required  by  the  time -optimal  and  by  the 
fuel -optimal  systems  are  presented. 

Let  t *  be  the  minimum  time  to  force  a  given  state  (Xj,x2)  to  (0,0).  The 
minimum  time  t*  is  determined  from  the  formulae  below: 


x2  -  2  Jx1  +  i  x^  -  m  7r  if  (x1>x2)eRm  m+1  and  m  =  0,  2,4, 6, . . . 


-x2  +  2  J-x1  +ix2  +  m  tt  if  (x1,x2)eRm  m+1  and  m  =  1, 3,  5, 7, . . . 

f*  i  , - j - 

x2  +  2  VXj  +  ^x2  -  mir  if  (x][1x2)eRm  m+1  and  m  =  -2,  -4,  -6, . . . 

-x.  -  2  V-x.+ixi?  +  mir  if  (x.,x  )eR _ ,.  and  m  =  -1,  -3,  -5, . . . 

2  12  12  m,  m+l 


(40) 


The  fuel  F*  consumed  by  the  time -optimal  system  is  given  by 


F*  =  kt*  (41) 

since  the  fuel  is  measured  by  Eq.  (36)  and  for  die  time  -.optimal  system  |u(t)|  =  1  and 
the  response  time  is  t*. 

Let  T  be  the  response  time  required  by  the  fuel  -optimal  system.  The  response 
time  T,  as  a  function  of  x^  and  x2,  is  given  below: 

If 

x2  >  0  (42) 

find  the  value  of  m,  m  =  0, 2, 4, . . . ,  such  that 


^27r(m  -  1)  —  x2  ^  ^27r(m  +  1)  ■  (43) 

Using  the  value  of  the  determined  m  the  response  time  is 
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1  r  i  2 

—  x-  +  mn  -  x 
*2  l 


j}  if  Xj  —  m^r  -  i  : 


2  2  i  2 

~  {i  *2  +  (m+2)7r  -  Xj}  if  x1  >  inff  -  £  x2 


V° 

find  the  value  of  m,  m  =  0,  -2,  -4 .  such  that 


f-2ir(m-l)  -  x2  -  -*^-2ir(m+l) 


Using  the  value  of  m  the  response  time  is 


~  {-ixSmff  -  x.}  if  x.^mtr+ix^ 
X„  2  1  1  4 


|  ~  {-ix2  +  (m-2)7r  -  Xj}  if  x1  <  mir  +  £  x^  J 

Let  Fq  denote  the  fuel  consumed  by  the  fuel  -optimal  system.  The  fuel  Fq  is 


given  by 


Fo  ■  k|x2l 


and,  of  course,  Fq  represents  the  minimum  fuel  required  to  force  (x^.x^)  to  (0,0). 
From  Eqs.  (42)  and  (45)  one  finds 


lim  t  =  00  , 

xj  -0 

(48) 

2 

lim  F  =  0 

(49) 

X 

to 

1 

O 

o 

The  large  response  times  required  by  the  fuel  -optimal  system  for  s-nall  values  of  x^  are 
undesirable  from  an  engineering  point  of  view.  There  exist  techniques  for  improving 
the  time  response  of  such  a  system  with  an  increase  in  the  amount  of  fuel  required. 

Such  techniques  have  been  reported  in  Ref.  8  and  they  can  be  used  to  improve  the 


t 


response  of  the  fuel -optimal  system. 

Another  disadvantage  of  the  fuel -optimal  system  and  its  engineering  realization 
of  Fig.  6  is  that  switching  from  u  =  0  to  u  =  +1  or  -1  occurs  only  at  the  curves. 
This  means  that  a  slight  malfunction  of  the  relays  in  Fig.  6  might  cause  the  control  to 
remain  at  u  =  0  and  thus  impair  the  time -response  of  the  system,  but  not  affect  the 
consumed  fuel.  An  engineering  cure  for  the  phenomenon  is  to  use  relays  with  some 
dead-zone  or  hysteresis  so  as  to  "widen"  the  yn  curves  in  the  plane  so  that  the 
transition  from  "off'  to  "on"  occurs  in  finite  areas  rather  than  on  curves. 


6.  CONCLUSIONS 


The  time -optimal  and  the  fuel -optimal  control  laws  have  been  derived  for  the 
attitude  control  of  a  satellite.  The  necessary  nonlinear  controllers  were  designed  and 
their  operation  explained. 
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APPENDIX 


Proof  of  Theorem  I 

We  shall  use  the  Maximum  Principle  (see  Ref.  4)  to  prove  Theorem  1.  The 
time -optimal  Hamiltonian  for  the  system  of  Eq.  (8)  is 


H  =  1  +  x2(t)p2(t)  +  u(t)p2(t)  , 

(49) 

The  minimum  of  H 

occurs  at 

u(t)  =  -  sgn  {p2(t)} 

(50) 

But 

pi " '  ta,(t)  ■ 0 

(51) 

P2(t)"  8x2(t)  “  Pl(t) 

(52) 

Let 

P^O)  «  ;  P2(0)  a  W2 

(53) 

Then 

P2(t)  a  *2  -  *jt 

(54) 

Therefore 

u(t)  =  -  sgn  U2  -  ^t} 

(55) 

Equation  (55)  means  that  only  the  four  control  sequences  (see  Definition  2)  can  be 
candidates  for  time -optimal  control 


{-1}.  {t-l.-l},  {-l.+l}  .  (56) 

We  shall  first  prove  that  if  (x^x^cy  ,  then  u  =  -Hi  is  the  time -optimal  control,  i.  e. 
the  sequence  {+l}  is  time-optimal.  Consider,  in  Fig.  7,  the  point  Aey  .  By 

Ot 

definition,  the  control  sequence  {*-l}  transfers  A  to  O  along  AO  (i.  e.  along  the  y 

of 

curve).  From  the  remaining  sequences  of  Eq.  (56)  only  die  sequence  {-l,+l}  can  force 
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A  to  O  along  the  trajectory  ABCC'AO,  provided  the  transition  from  *1  to  +1  occurs 
at  Bey  But  the  time  along  ABCC'AO  Is  greater  than  the  time  along  AO  (because 
AO  Is  a  portion  of  ABCC'AO),  hence  the  sequence  {-l,+l}  is  not  time -optimal,  and, 
hence  {fl}  is  time -optimal.  Q.  E.  D.  Using  similar  reasoning  we  can  prove  that 
,  then  u  =  -1  in  the  time -optimal  control. 

Now  suppose  the  initial  state  (x^x^eRq  j.  The  only  control  sequences  that 
can  force  (Xj.x^  to  (0,0)  are  {-l,+l}  and  {fl.-l}.  The  sequence  {-l,+l}  is  the 
time -optimal  one;  this  follows  from  the  arguments  presented  in  Section  3  where  the  Yj 
curve  was  defined  (that  is,  the  arguments  following  Definition  2  up  to  Preposition  3). 

For  all  the  other  states  the  procedure  is  quite  similar.  First  one  finds  the 
control  sequences  which  force  (Xj.x^  to  (0, 0)  and  then  by  direct  computation  one 
can  prove  all  the  statements  of  Theorem  1. 
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FIGURE  CAPTIONS 


Fig.  1 
Fig.  2 
Fig.  3 
Fig.  4 

Fig.  5 

Fig.  6 
Fig.  7 


The  physical  system. 

The  y  curves  and  the  R  , ,  regions, 
n  n,  n+1  * 

The  trajectories  XZAA'O  and  XYO. 

An  engineering  realization  of  the  time  -optimal  control  law  provided 
by  Theorem  1. 

The  y  curves  (n  =  0,  +2,  +  4, . . . ).  ABO  and  CDD'EFF'O  are 
n  - 

examples  of  fuel -optimal  trajectories. 

An  engineering  realization  of  die  fuel  -optimal  control  law  provided 
by  Theorem  2. 

The  trajectories  which  arise  from  different  control  sequences. 
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